Towards a Calibrated Corpus for Compression Testing

نویسندگان

  • Mark R. Titchener
  • Peter M. Fenwick
  • M. C. Chen
چکیده

A mini-corpus of twelve ‘calibrated’ binary-data files have been produced for systematic evaluation of compression algorithms (Available at http : //www.tcode.auckland.ac.nz/-mark/corpus). These are generated within the framework of a deterministic theory of string complexity [2]. Here the T-complexity of a string z (measured in taugs) is defined as CT(Q) = Ci logz(ki + 1): where the positive integers k, are the T-expansion parameters for the corresponding string production process outlined in [l] . C ( ) T z 1s o bserved in [2] to be the Logarithmic Integral of the total information content Iz, of z (measured in nats), i.e., CT(X) = li(1,). T he average entropy is HZ= I,/jxI, i.e.: the total information content divided by the length of 2. Thus CT(X) = li(H, ~1x1). Alternatively, the information rate along a string may be described by an entropy function Hz(n), 0 < n L: 1x1 for the string [3]. Assuming that Hz(n) is continuously integrable along the length of the 2, then I, = $“’ H,(n)&. T h u s CT(X) = li (JJ”’ H,(n)&). So ving for Hz(n): that is differentiating 1 both sides and rearranging, we get: ~G(4n) Hz(n) = Sn x &I, @-l (w4?J)) (1)

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Development of a compression system dynamic simulation code for testing and designing of anti-surge control system

In recent years, several research activities have been conducted to develop knowledge in analysis, design and optimization of compressor anti-surge control system. Since the anti-surge control testing on a full-scale compressor is limited to possible consequences of failure, and also the experimental facility can be expensive to set up control strategies and logic, design process often involves...

متن کامل

Viscoelastic parameter identification of human brain tissue.

Understanding the constitutive behavior of the human brain is critical to interpret the physical environment during neurodevelopment, neurosurgery, and neurodegeneration. A wide variety of constitutive models has been proposed to characterize the brain at different temporal and spatial scales. Yet, their model parameters are typically calibrated with a single loading mode and fail to predict th...

متن کامل

Finite State Models for the Generation of Large Corpora of Natural Language Texts

Natural languages are probably one of the most common type of input for text processing algorithms. Therefore, it is often desirable to have a large training/testing set of input of this kind, especially when dealing with algorithms tuned for natural language texts. The problem in creating good corpora is that often natural language texts are too short with respect to the dimension required to ...

متن کامل

Attitude of Health Care Professionals Towards Voluntary Counseling and Testing for HIV/AIDS

Introduction: HIV counseling and testing is the vital and preliminary interventional step aimed at reducing the spread of HIV infection. The study was designed to determine the attitude of health care professionals towards voluntary counseling and testing (VCT) for HIV/AIDS at Irrua Specialist Teaching Hospital. Materials & Methods: In this descriptive cross sectional prospective study a sel...

متن کامل

Substructure Model for Concrete Behavior Simulation under Cyclic Multiaxial Loading

This paper proposes a framework for the constitutive model based on the semi-micromechanical aspects of plasticity, including damage progress for simulating behavior of concrete under multiaxial loading. This model is aimed to be used in plastic and fracture analysis of both regular and reinforced concrete structures, for the framework of sample plane crack approach. This model uses multilamina...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999